Image and Video Understanding in Big Data

نویسندگان

  • Vittorio Murino
  • Shaogang Gong
  • Chen Change Loy
  • Loris Bazzani
چکیده

h 1 The huge volume of data produced every day is posing a signifiant challenge to computer scientists since it is infeasible that this ata can be effectively processed and consistently interpreted by umans manually, even to a very small extent. Due to the automaion of many industrial processes and the advent of cheaper and igh-performance sensors, many aspects of life, including medial, commercial, industrial and security areas are increasingly more haracterized by large data collections that must also be processed nd communicated with. DNA sequencing, radiographic and imagng examinations (e.g. MRI, X Ray, echography), urban surveillance, raffic monitoring, customer profiling and e-commerce, visual inpection, industrial machine maintenance and failure prediction re some examples of the vast range of possible applications in ur everyday social and working life characterized by large data atherings. Among these, visual data takes a prominent role given the large nd pervasive diffusion of imaging devices in our cities, industries, t home, and in our hands given the plenitude of personal deices such as smart phones at our disposal. For the latter, one can ppreciate readily the scale of the data consumed just by thinkng about the amount of image and video data downloaded evry minute in social media such as Facebook, Instagram, Snapchat, mong others. Moreover, visual data is by large the most diverse nd demanding media, as compared to text for instance, growing t an unprecedented speed. This requires the design of effective ethods to manage it, by mining relevant information while disarding redundant or useless data. To that end, more scalable and obust methods are required to efficiently index, retrieve, organize, nterpret and interact with such big visual data, and this cannot be one without automatic or semi-automatic, e.g. human in the loop, ethods and processes capable of distil useful observations from a arge quantity of raw data. In this context, it is clear that big visual data analysis and unerstanding impose significant scientific and technological chalenges since one not only should have to advance methods able to racefully scale to both big and diverse data whilst being computaionally cheap, but also should need to develop processes suitable or learning from big and diverse data without incurring prohibitive osts in human and monetary resources, and of time, required y exhaustive data annotation. Moreover, efficient online learning ethods could be required to cope with data acquired over time.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Novel Approach to Background Subtraction Using Visual Saliency Map

Generally human vision system searches for salient regions and movements in video scenes to lessen the search space and effort. Using visual saliency map for modelling gives important information for understanding in many applications. In this paper we present a simple method with low computation load using visual saliency map for background subtraction in video stream. The proposed technique i...

متن کامل

Two Novel Chaos-Based Algorithms for Image and Video Watermarking

In this paper we introduce two innovative image and video watermarking algorithms. The paper’s main emphasis is on the use of chaotic maps to boost the algorithms’ security and resistance against attacks. By encrypting the watermark information in a one dimensional chaotic map, we make the extraction of watermark for potential attackers very hard. In another approach, we select embedding po...

متن کامل

Extending SAR Image Despckling methods for ViSAR Denoising

Synthetic Aperture Radar (SAR) is widely used in different weather conditions for various applications such as mapping, remote sensing, urban, civil and military monitoring. Recently, a new radar sensor called Video SAR (ViSAR) has been developed to capture sequential frames from moving objects for environmental monitoring applications. Same as SAR images, the major problem of ViSAR is the pres...

متن کامل

SIDF: A Novel Framework for Accurate Surgical Instrument Detection in Laparoscopic Video Frames

Background and Objectives: Identification of surgical instruments in laparoscopic video images has several biomedical applications. While several methods have been proposed for accurate detection of surgical instruments, the accuracy of these methods is still challenged high complexity of the laparoscopic video images. This paper introduces a Surgical Instrument Detection Framework (SIDF) for a...

متن کامل

Video-based face recognition in color space by graph-based discriminant analysis

Video-based face recognition has attracted significant attention in many applications such as media technology, network security, human-machine interfaces, and automatic access control system in the past decade. The usual way for face recognition is based upon the grayscale image produced by combining the three color component images. In this work, we consider grayscale image as well as color s...

متن کامل

پنهان نگاری وفقی براساس پیچیدگی نسبی پیکسلها در تصاویر دوسطح

In recent years steganography has attracted the attention of many researchers. Steganography can be used in media such as still image, audio, and video. Image is one of the most frequently used media. One of the forms of presentation of an image is halftone. In this paper a method is presented which hides data in halftone images. The data hiding process is performed during the halftoning step. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Computer Vision and Image Understanding

دوره 156  شماره 

صفحات  -

تاریخ انتشار 2017